Online clustering via finite mixtures of Dirichlet and minimum message length
نویسندگان
چکیده
This paper presents an online algorithm for mixture model-based clustering. Mixture modeling is the problem of identifying and modeling components in a given set of data. The online algorithm is based on unsupervised learning of finite Dirichlet mixtures and a stochastic approach for estimates updating. For the selection of the number of clusters, we use the minimum message length (MML) approach. The proposed method is validated by synthetic data and by an application concerning the dynamic summarization of image databases. r 2006 Elsevier Ltd. All rights reserved.
منابع مشابه
Positive Data Clustering Using Finite Inverted Dirichlet Mixture Models
Positive Data Clustering Using Finite Inverted Dirichlet Mixture Models Taoufik BDIRI In this thesis we present an unsupervised algorithm for learning finite mixture models from multivariate positive data. Indeed, this kind of data appears naturally in many applications, yet it has not been adequately addressed in the past. This mixture model is based on the inverted Dirichlet distribution, whi...
متن کاملMML-Based Approach for Finite Dirichlet Mixture Estimation and Selection
This paper proposes an unsupervised algorithm for learning a finite Dirichlet mixture model. An important part of the unsupervised learning problem is determining the number of clusters which best describe the data. We consider here the application of the Minimum Message length (MML) principle to determine the number of clusters. The Model is compared with results obtained by other selection cr...
متن کاملPositive Data Clustering based on Generalized Inverted Dirichlet Mixture Model
Positive Data Clustering based on Generalized Inverted Dirichlet Mixture Model Mohamed Al Mashrgy, Ph.D. Concordia University, 2015 Recent advances in processing and networking capabilities of computers have caused an accumulation of immense amounts of multimodal multimedia data (image, text, video). These data are generally presented as high-dimensional vectors of features. The availability of...
متن کاملAnalytical D’Alembert Series Solution for Multi-Layered One-Dimensional Elastic Wave Propagation with the Use of General Dirichlet Series
A general initial-boundary value problem of one-dimensional transient wave propagation in a multi-layered elastic medium due to arbitrary boundary or interface excitations (either prescribed tractions or displacements) is considered. Laplace transformation technique is utilised and the Laplace transform inversion is facilitated via an unconventional method, where the expansion of complex-valued...
متن کاملOn the Inference of Dirichlet Mixture Priors for Protein Sequence Comparison
Dirichlet mixtures provide an elegant formalism for constructing and evaluating protein multiple sequence alignments. Their use requires the inference of Dirichlet mixture priors from curated sets of accurately aligned sequences. This article addresses two questions relevant to such inference: of how many components should a Dirichlet mixture consist, and how may a maximum-likelihood mixture be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Eng. Appl. of AI
دوره 19 شماره
صفحات -
تاریخ انتشار 2006